This paper examines the feasibility of discovering "title-like" terms using a decision tree classifier from the document. The premise of discovering title-like terms is that title terms and title-like terms should behave similarly in the document. This behavior is characterized by a set of distributional and linguistic features. By training the classifier to observe the behavior of title terms in a balanced manner using 25,000 titles in Reuters articles, other terms with similar behavior would also be discovered. Based on 5000 unseen titles, the recall of title terms was 83%, similar to the manual identification of title terms. The precision of finding title terms is low (i.e., 32%) because some non-title but title-like terms should have be...
In this paper, we present an analysis based on linguistic and typographic features that allows for t...
Abstract The purpose of automatic title generation is to understand a document and to summarize it w...
Abstract. This paper discusses fundamental issues involved in word selection for title generation. W...
In this paper, we show how we can learn to select good words for a document title. We view the probl...
To digest tremendous documents efficiently, people often resort to their titles, which normally prov...
Automatic titling of text documents is an essential task for several applications (automatic heading...
We propose an intelligent document title classification agent based on a theory of information infer...
We propose an intelligent document title classification agent based on a theory of information infe...
We propose an intelligent document title classification agent based on a theory of information infer...
The title of a document has two roles, to give a compact summary and to lead the reader to read the ...
Document clustering is a powerful technique to detect topics and their relations for information bro...
This paper is concerned with automatic extraction of titles from the bodies of HTML documents (web p...
This paper is concerned with automatic extraction of titles from the bodies of HTML documents. Title...
This paper is concerned with automatic extraction of titles from the bodies of HTML documents. Title...
Abstract. This paper discusses fundamental issues involved in word selection for title generation. W...
In this paper, we present an analysis based on linguistic and typographic features that allows for t...
Abstract The purpose of automatic title generation is to understand a document and to summarize it w...
Abstract. This paper discusses fundamental issues involved in word selection for title generation. W...
In this paper, we show how we can learn to select good words for a document title. We view the probl...
To digest tremendous documents efficiently, people often resort to their titles, which normally prov...
Automatic titling of text documents is an essential task for several applications (automatic heading...
We propose an intelligent document title classification agent based on a theory of information infer...
We propose an intelligent document title classification agent based on a theory of information infe...
We propose an intelligent document title classification agent based on a theory of information infer...
The title of a document has two roles, to give a compact summary and to lead the reader to read the ...
Document clustering is a powerful technique to detect topics and their relations for information bro...
This paper is concerned with automatic extraction of titles from the bodies of HTML documents (web p...
This paper is concerned with automatic extraction of titles from the bodies of HTML documents. Title...
This paper is concerned with automatic extraction of titles from the bodies of HTML documents. Title...
Abstract. This paper discusses fundamental issues involved in word selection for title generation. W...
In this paper, we present an analysis based on linguistic and typographic features that allows for t...
Abstract The purpose of automatic title generation is to understand a document and to summarize it w...
Abstract. This paper discusses fundamental issues involved in word selection for title generation. W...